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GROWTH DIFFERENTIATION FACTO R-7 
BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The invention relates generally to growth factors and specifically to a new 
member of the transforming growth factor beta (TGF-/3) superfamily, which is 
denoted, growth differentiation factor-7 (GDF-7). 

2. Description of Related Art 

The transforming growth factor fi (TGF-/J) superfamily encompasses a group 
of structurally-related proteins which affect a wide range of differentiation 
processes during embryonic development The family includes, Mullerian 
inhibiting substance (MIS), which is required for normal male sex development 
(Behringer, ef a/., Nature, 345:167, 1990), Drosophila decapentaplegic (DPP) 
gene product, which is required for dorsal-ventral axis formation and 
morphogenesis of the imaginai disks (Padgett, era/., Nature, 325:81-84, 1987), 
the Xenopus Vg-1 gene product, which localizes to the vegetal pole of eggs 
((Weeks, era/., Cell, 51:861-867, 1987), the activins (Mason, etai, Biochem, 
Biophys. Res. Commun., 13£:957-964, 1986), which can induce the formation 
of mesoderm and anterior structures in Xenopus embryos (Thomsen, ef a/., 
Cell, 63:485, 1990). and the bone morphogenetic proteins (BMPs, osteogenin. 
OP-1) which can induce de novo cartilage and bone formation (Sampath, et 
a/., J. Biol. Chem., 265:13198, 1990). The TGF-^s can influence a variety of 
differentiation processes, including adipogenesis, myogenesis. chondrogenesis. 
hematopoiesis, and epithelial cell differentiation (for review, see Massague, Cell 
49:437. 1987). 
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The proteins of the TGF-/J family are initially synthesized as a large precursor 
protein which subsequently undergoes proteolytic cleavage at a cluster of basic 
residues approximately 110-140 amino acids from the C-terminus. The C- 
terminal regions, or mature regions, of the proteins are all structurally related 
5 and the different family members can be classified into distinct subgroups 
based on the extent of their homology. Although the homologies within 
particular subgroups range from 70% to 90% amino acid sequence identity, the 
homologies between subgroups are significantly lower, generally ranging from 
only 20% to 50%. In each case, the active species appears to be a disulfide- 

10 linked dimer of C-terminal fragments. Studies have shown that when the pro- 
region of a member of the TGF-0 family is coexpressed with a mature region 
of another member of the TGF-p family, intracellular dimerization and secretion 
of biologically active homodimers occur (Gray, A., and Maston, A., Science, 
247:1328. 1990). Additional studies by Hammonds, et a/.. (Molec. Endocrin. 

15 5:1 49. 1 991) showed that the use of the BMP-2 pro-region combined with the 
BMP-4 mature region led to dramatically improved expression of mature BMP- 
4. For most of the family members that have been studied, the homodimeric 
species has been found to be biologically active, but for other family members, 
like the inhibins (Ling, ef a/., Nature, 321:779, 1986) and the TGF-/9S (Cheifetz, 

20 et al., Cell, 48:409, 1987), heterodimers have also been detected, and these 
appear to have different biological properties than the respective homodimers. 

<• 

Identification of new factors that are tissue-specific in their expression pattern 
will provide a greater understanding of that tissue's development and function. 
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SUMMARY OF THE INVENTION 

The present invention provides a cell growth and differentiation factor, GDF-7, 
a polynucleotide sequence which encodes the factor, and antibodies which are 
immunoreactive with the factor. This factor appears to relate to various cell 
5 proliferative disorders, especially those involving neural tissue. 

Thus, in one embodiment, the invention provides a method for detecting a cell 
proliferative disorder of neural origin and which is associated with GDF-7. In 
another embodiment, the invention provides a method for treating a cell 
proliferative disorder by suppressing or enhancing GDF-7 activity. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows RNase protection assays detecting expression of GDF-7 
mRNA in neural tissue and cell lines. The arrow denotes the position of the 
protected species. 

5 FIGURE 2 shows nucleotide and predicted amino acid sequences of murine 
GDF-7. The putative pentabasic processing sites in the murine sequence is 
boxed. 

FIGURE 3 shows the alignment of the C-terminal sequences of GDF-7 with 
other members of the TGF-/3 superfamiiy. The conserved cysteine residues are 
10 boxed. Dashes denote gaps introduced in order to maximize alignment 

FIGURE 4 shows amino acid homologies among different members of the TGF- 
p superfamiiy. Numbers represent percent amino acid identities between each 
pair calculated from the first conserved cysteine to the C-terminus. Boxes 
represent homologies among highly-related members within particular 
15 subgroups. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a growth and differentiation factor, GDF-7 and 
a polynucleotide sequence encoding GDF-7. GDF-7 is expressed in neural 
tissue. In one embodiment, the invention provides a method for detection of 
5 a cell proliferative disorder of neural origin which is associated with GDF-7 
expression. In another embodiment, the invention provides a method for 
treating a cell proliferative disorder by using an agent which suppresses or 
enhances GDF-7 activity. 

The TGF-/J superfamily consists of multifunctional polypeptides that control 
1 0 proliferation, differentiation, and other functions in many cell types. Many of the 
peptides have regulatory, both positive and negative, effects on other peptide 
growth factors. The structural homology between the GDF-7 protein of this 
invention and the members of the TGF-fi family, indicates that GDF-7 is a new 
member of the family of growth and differentiation factors. Based on the 
1 5 known activities of many of the other members, it can be expected that GDF-7 
will also possess biological activities that will make it useful as a diagnostic and 
therapeutic reagent. 

In particular, certain members of this superfamily have expression patterns or 
possess activities that relate to the function of the nervous system. For 

20 example, one family member, namely GDNF. has been shown to be a potent 
neurotrophic factor that can promote the survival of dopaminergic neurons (Lin, 
et al.. Science, 2§0:1 130). Another family member, namely dorsalin, is capable 
of promoting the differentiation of neural crest cells (Baster, era/., Cell, Z3.:687). 
The inhibins and activins have been shown to be expressed in the brain 

25 (Meunier, et a/., Proc. Natl. Acad. ScL, USA, g§:247. 1988; Sawchenko. et al., 
Nature, 334:615. 1988). and activin has been shown to be capable of 
functioning as a nerve cell survival molecule (Schubert, ef al., Nature, 344:868. 
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1990). Another family member, namely, GDF-1, is nervous system-specific in 
its expression pattern (Lee, Proc. Natl. Acad. Sci., USA, 88:4250. 1991). and 
certain other family members, such as Vgr-1 (Lyons, ef a/., Proc. Natl Acad. 
Sci., USA, 86:4554. 1989; Jones, ef al., Development, jm:531, 1991), OP-1 
5 (Ozkaynak, ef al., J. Biol. Chem., 267:25220, 1992), and BMP-4 (Jones, et at., 
Development, 1_U:531, 1991). are also known to be .expressed in the nervous 
system. By analogy GDF-7 may have applications in the treatment of 
neurodegenerative diseases or in maintaining cells or tissues in culture prior 
to transplantation. 

10 Several members of the TGF-0 superfamily possess activities suggesting 
possible applications for the treatment of cell proliferative disorders, such as 
cancer. In particular. TGF-p has been shown to be potent growth inhibitor for 
a variety of cell types (Massague, Cell, 49:437. 1987), MIS has been shown to 
inhibit the growth of human endometrial carcinoma tumors in nude mice 

15 (Donahoe. ef al., Ann. Surg., 19.4:472, 1981), and inhibin a has been shown to 
suppress the development of tumors both in the ovary and in the testis 
(Matzuk. et al., Nature, 360:313, 1992). GDF-7 may have a similar activity and 
may therefore be useful as an anti-proliferative agent, such as for the treatment 
of tumors of neural origin. 

20 Many of the members of the TGF-p family are also important mediators of 
tissue repair. TGF-0 has been shown to have marked effects on the formation 
of collagen and caues of striking angiogenic response in teh newborn mouse 
(Roberts, et al., Proc. Natl. Acad. Sci., USA, 83:4167, 1986). the BMP's can 
induce new bone growth and are effective for the treatment of fractures and 

25 other skeletal defects (Glowacki. et al., Lancet, 1:959. 1981; Ferguson, et al., 
Clin. Orthoped. Relat. Res., 227:265. 1988; Johnson, ef al., Clin Orthoped. 
Relat. Res., 230:257. 1988). GDF-7 may have similar activities and may be 
useful in repair of tissue injury caused by trauma or burns for example. 
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The term "substantially pure" as used herein refers to GDF-7 which is 
substantially free of other proteins, lipids, carbohydrates or other materials with 
which it is naturally associated. One skilled in the art can purify GDF-7 using 
standard techniques for protein purification. The substantially pure polypeptide 
will yield a single major band on a non-reducing polyacrylamide gel. The purity 
of the GDF-7 polypeptide can also be determined by amino-terminal amino 
acid sequence analysis. GDF-7 polypeptide includes functional fragments of 
the polypeptide, as long as the activity of GDF-7 remains. Smaller peptides 
containing the biological activity of GDF-7 are included in the invention. 

The invention provides polynucleotides encoding the GDF-7 protein. These 
polynucleotides include DNA, cDNA and RNA sequences which encode GDF-7. 
It is understood that all polynucleotides encoding all or a portion of GDF-7 are 
also included herein, as long as they encode a polypeptide with GDF-7 activity. 
Such polynucleotides include naturally occurring, synthetic, and intentionally 
manipulated polynucleotides. For example, GDF-7 polynucleotide may be 
subjected to site-directed mutagenesis. The polynucleotide sequence for GDF- 
7 also includes antisense sequences. The polynucleotides of the invention 
include sequences that are degenerate as a result of the genetic code. There 
are 20 natural amino acids, most of which are specified by more than one 
codon. Therefore, all degenerate nucleotide sequences are included in the 
invention as long as the amino acid sequence of GDF-7 polypeptide encoded 
by the nucleotide sequence is functionally unchanged. 

Specifically disclosed herein is a genomic DNA sequence containing a portion 
of the GDF-7 gene. The sequence contains an open reading frame 
corresponding to the predicted C-terminal region of the GDF-7 precursor 
protein. The encoded polypeptide is predicted to contain a potential 
pentabasic proteolytic processing site. Cleavage of the precursor at this 
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site would generate a mature biologically active C-terminal fragment of 146 
amino acids with a predicted molecular weight of approximately 14.900. 

The C-terminal region of GDF-7 following the putative proteolytic processing 
site shows significant homology to the known members of the TGF-p 
superfamily. The GDF-7 sequence contains most of the residues that are 
highly conserved in other family members (see Figure 3). Among the known 
family members, GDF-7 is most homologous to BMP-2 and BMP-4 (57% 
sequence identity) (see Figure 4). 

Minor modifications of the recombinant GDF-7 primary amino acid sequence 
may result in proteins which have substantially equivalent activity as compared 
to the GDF-7 polypeptide described herein. Such modifications may be 
deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the 
polypeptides produced by these modifications are included herein as long as 
the biological activity of GDF-7 still exists. Further, deletion of one or more 
amino acids can also result in a modification of the structure of the resultant 
molecule without significantly altering its biological activity. This can lead to the 
development of a smaller active molecule which would have broader utility. For 
example, one can. remove amino or carboxy terminal amino acids which are 
not required for GDF-7 biological activity. 

The nucleotide sequence encoding the GDF-7 polypeptide of the invention 
includes the disclosed sequence and conservative variations thereof. The term 
"conservative variation" as used herein denotes the replacement of an amino 
acid residue by another, biologically similar residue. Examples of conservative 
variations include the substitution of one hydrophobic residue such as 
isoleucine, valine, leucine or methionine for another, or the substitution of one 
polar residue for another, such as the substitution of arginine for lysine, 
glutamic for aspartic acid, or glutamine for asparagine, and the like. The term 
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"conservative variation" also includes the use of a substituted amino acid in 
place of an unsubstituted parent amino acid provided that antibodies raised to 
the substituted polypeptide also immunoreact with the unsubstituted polypep- 
tide. 

DNA sequences of the invention can be obtained by several methods. For 
example, the DNA can be isolated using hybridization techniques which are 
well known in the art. These include, but are not limited to: 1) hybridization of 
genomic or cDNA libraries with probes to detect homologous nucleotide 
sequences, 2) polymerase chain reaction (PCR) on genomic DNA or cDNA 
using primers capable of annealing to the DNA sequence of interest, and 3) 
antibody screening of expression libraries to detect cloned DNA fragments with 
shared structural features. 

Preferably the GDF-7 polynucleotide of the invention is derived from a 
mammalian organism, and most preferably from a mouse, rat, or human. 
Screening procedures which rely on nucleic add hybridization make it possible 
to isolate any gene sequence from any organism, provided the appropriate 
probe is available. Oligonucleotide probes, which correspond to a part of the 
sequence encoding the protein in question, can be synthesized chemically. 
This requires that short, oligopeptide stretches of amino acid sequence must 
be known. The DNA sequence encoding the protein can be deduced from the 
genetic code, however, the degeneracy of the code must be taken into 
account. It is possible to perform a mixed addition reaction when the 
sequence is degenerate. This includes a heterogeneous mixture of denatured 
double-stranded DNA. For such screening, hybridization is preferably 
performed on either single-stranded DNA or denatured double-stranded DNA. 
Hybridization is particularly useful in the detection of cDNA clones derived from 
sources where an extremely low amount of mRNA sequences relating to the 
polypeptide of interest are present. In other words, by using stringent 
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hybridizatiqn conditions directed to avoid non-specific binding, it is possible, 
lor example, to allow the autoradiographic visualization of a specific cDNA 
clone by the hybridization of the target DNA to that single probe in the mixture 
which is its complete complement (Wallace, ef a/., Nucl. Acid Res., 9:879, 
5 1981). 

The development of specific DNA sequences encoding GDF-7 can also be 
obtained by: 1) isolation of double-stranded DNA sequences from the genomic 
DNA; 2) chemical manufacture of a DNA sequence to provide the necessary 
codons for the polypeptide of interest; and 3) in vitro synthesis of a double- 
10 stranded DNA sequence by reverse transcription of mRNA isolated from a 
eukaryotic donor cell. In the latter case, a double-stranded DNA complement 
of mRNA is eventually formed which is generally referred to as cDNA. 

Of the three above-noted methods for developing specific DNA sequences for 
use in recombinant procedures, the isolation of genomic DNA isolates is the 
15 least common. This is especially true when it is desirable to obtain the 
microbial expression of mammalian polypeptides due to the presence of 
introns. 

The synthesis of DNA sequences is frequently the method of choice when the 
entire sequence of amino acid residues of the desired polypeptide product is 

20 known. When the entire sequence of amino acid residues of the desired 
polypeptide is not known, the direct synthesis of DNA sequences is not 
possible and the method of choice is the synthesis of cDNA sequences. 
Among the standard procedures for isolating cDNA sequences of interest is the 
formation of plasmid- or phage-carrying cDNA libraries which are derived from 

25 reverse transcription of mRNA which is abundant in donor cells that have a 
high level of genetic expression. When used in combination with polymerase 
chain reaction technology, even rare expression products can be cloned. In 
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those cases where significant portions of the amino acid sequence of the 
polypeptide are known, the production of labeled single or double-stranded 
DNA or RNA probe sequences duplicating a sequence putatively present in the 
target cDNA may be employed in DNA/DNA hybridization procedures which are 
carried out on cloned copies of the cDNA which have been denatured into a 
single-stranded form (Jay, ef a/., Nucl. Acid Res., H:2325, 1983). 

A cDNA expression library, such as lambda gt11, can be screened indirectly 
for GDF-7 peptides having at least one epitope, using antibodies specific for 
GDF-7. Such antibodies can be either polyclonal^ or monoclonally derived 
and used to detect expression product indicative of the presence of GDF-7 
cDNA. 

DNA sequences encoding GDF-7 can be expressed in vitro by DNA transfer 
into a suitable host cell. "Host cells" are cells in which a vector can be 
propagated and its DNA expressed. The term also includes any progeny of 
the subject host cell. It is understood that all progeny may not be identical to 
the parental cell since there may be mutations that occur during replication. 
However, such progeny are included when the term "host cell" is used. 
Methods of stable transfer, meaning that the foreign DNA is continuously 
maintained in the host, are known in the art. 

In the present invention, the GDF-7 polynucleotide sequences may be inserted 
into a recombinant expression vector. The term "recombinant expression 
vector" refers to a plasmid, virus or other vehicle known in the art that has 
been manipulated by insertion or incorporation of the GDF-7 genetic 
sequences. Such expression vectors contain a promoter sequence which 
facilitates the efficient transcription of the inserted genetic sequence of the host. 
The expression vector typically contains an origin of replication, a promoter, as 
well as specific genes which allow phenotypic selection of the transformed 
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cells. Vectors suitable for use in the present invention include, but are not 
limited to the T7-based expression vector for expression in bacteria 
(Rosenberg, et a/.. Gene. 56:125. 1987), the pMSXND expression vector for 
expression in mammalian cells (Lee and Nathans, J. Biol. Chem., 263:3521. 
5 1988) and baculovirus-derived vectors for expression in insect cells. The DNA 
segment can be present in the vector operably linked to regulatory elements, 
for example, a promoter (e.g., T7, metallothionein I, or polyhedrin promoters). 

Polynucleotide sequences encoding GDF-7 can be expressed in either 
prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and 

10 mammalian organisms. Methods of expressing DNA sequences having 
eukaryotic or viral sequences in prokaryotes are well known in the art. 
Biologically functional viral and plasmid DNA vectors capable of expression and 
replication in a host are known in the art. Such vectors are used to incorp- 
orate DNA sequences of the invention. Preferably, the mature C-terminal 

15 region of GDF-7 is expressed from a cDNA clone containing the entire coding 
sequence of GDF-7. Alternatively, the C-terminal portion of GDF-7 can be 
expressed as a fusion protein with the pro- region of another member of the 
TGF-£ family or co-expressed with another pro- region (see for example. 
Hammonds, et a/., Molec. Endocrin. 5:149, 1991; Gray, A, and Mason, A, 

20 Science, 247:1328. 1990). 

Transformation of a host cell with recombinant DNA may be carried out by 
conventional techniques as are well known to those skilled in the art. Where 
the host is prokaryotic. such as E. coli, competent cells which are capable of 
DNA uptake can be prepared from cells harvested after exponential growth 
25 phase and subsequently treated by the CaCI 2 method using procedures well 
known in the art. Alternatively, MgCI 2 or RbCI can be used. Transformation 
can also be performed after forming a protoplast of the host cell if desired. 
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When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechanical procedures such as 
microinjection, electroporation, insertion of a plasmid encased in liposomes, or 
virus vectors may be used. Eukaryotic cells can also be cotransformed with 
DNA sequences encoding the GDF-7 of the invention, and a second foreign 
DNA molecule encoding a selectable phenotype, such as the herpes simplex 
thymidine kinase gene. Another method is to use a eukaryotic viral vector, 
such as simian virus 40 (SV40) or bovine papilloma virus, to transiently infect 
or transform eukaryotic cells and express the protein, (see for example, 
Eukaryotic Viral Vectors. Cold Spring Harbor Laboratory, Gluzman ed.. 1982). 

Isolation and purification of microbial expressed polypeptide, or fragments 
thereof, provided by the invention, may be carried out by conventional means 
including preparative chromatography and immunological separations involving 
monoclonal or polyclonal antibodies. 

The invention includes antibodies immunoreactive with GDF-7 polypeptide or 
functional fragments thereof. Antibody which consists essentially of pooled 
monoclonal antibodies with different epitopic specificities, as well as distinct 
monoclonal antibody preparations are provided. Monoclonal antibodies are 
made from antigen containing fragments of the protein by methods well known 
to those skilled in the art (Kohler, et a/., Nature, 256:495, 1975). The term 
antibody as used in this invention is meant to include intact molecules as well 
as fragments thereof, such as Fab and F(ab') 2 . which are capable of binding 
an epitopic determinant on GDF-7. 

The term "cell-proliferative disorder" denotes malignant as well as non-malignant 
cell populations which often appear to differ from the surrounding tissue both 
morphologically and genotypically. Malignant cells (i.e. cancer) develop as a 
result of a multistep process. The GDF-7 polynucleotide that is an antisense 
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molecule is useful in treating malignancies of the various organ systems, 
particularly, for example, cells in neural tissue. Essentially, any disorder which 
is etiologically linked to altered expression of GDF-7 could be considered 
susceptible to treatment with a GDF-7 suppressing reagent. One such 
5 disorder is a malignant cell proliferative disorder, for example. 

The invention provides a method for detecting a cell proliferative disorder of 
neural tissue which comprises contacting an anti-GDF-7 antibody with a cell 
suspected of having a GDF-7 associated disorder and detecting binding to the 
antibody. The antibody reactive with GDF-7 is labeled with a compound which 

0 allows detection of binding to GDF-7. For purposes of the invention, an 
antibody specific for GDF-7 polypeptide may be used to detect the level of 
GDF-7 in biological fluids and tissues. Any specimen containing a detectable 
amount of antigen can be used. A preferred sample of this invention is neural 
tissue. The level of GDF-7 in the suspect cell can be compared with the level 

5 in a normal cell to determine whether the subject has a GDF-7-associated cell 
proliferative disorder. Preferably the subject is human. 

The antibodies of the invention can be used in any subject in which it is 
desirable to administer in vitro or in vivo immunodiagnosis or immunotherapy. 
The antibodies of the invention are suited for use, for example, in immuno- 

0 assays in which they can be utilized in liquid phase or bound to a solid phase 
carrier. In addition, the antibodies in these immunoassays can be detectably 
labeled in various ways. Examples of types of immunoassays which can utilize 
antibodies of the invention are competitive and non-competitive immunoassays 
in either a direct or indirect format. Examples of such immunoassays are the 

5 radioimmunoassay (RIA) and the sandwich (immunometric) assay. Detection 
of the antigens using the antibodies of the invention can be done utilizing 
immunoassays which are run in either the forward, reverse, or simultaneous 
modes, including immunohistochemical assays on physiological samples. 
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Those of skill in the art will know, or can readily discern, other immunoassay 
formats without undue experimentation. 

The antibodies of the invention can be bound to many different carriers and 
used to detect the presence of an antigen comprising the polypeptide of the 
invention. Examples of well-known carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran. nylon, amylases, natural and modified 
celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier 
can be either soluble or insoluble for purposes of the invention. Those skilled 
in the art will know of other suitable carriers for binding antibodies, or will be 
able to ascertain such, using routine experimentation. 

There are many different labels and methods of labeling known to those of 
ordinary skill in the art. Examples of the types of labels which can be used in 
the present invention include enzymes, radioisotopes, fluorescent compounds, 
colloidal metals, chemiluminescent compounds, phosphorescent compounds, 
and bioluminescent compounds. Those of ordinary skill in the art will know of 
other suitable labels for binding to the antibody, or will be able to ascertain 
such, using routine experimentation. 

Another technique which may also result in greater sensitivity consists of 
coupling the antibodies to low molecular weight haptens. These haptens can 
then be specifically detected by means of a second reaction. For example, it 
is common to use such haptens as biotin, which reacts with avidin, or 
dinitrophenyl, puridoxal, and fluorescein, which can react with specific anti- 
hapten antibodies. 

In using the monoclonal antibodies of the invention for the in vivo detection of 
antigen, the detectably labeled antibody is given a dose which is diagnostically 
effective. The term "diagnostically effective" means that the amount of 
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detectably labeled monoclonal antibody is administered in sufficient quantity to 
enable detection of the site having the antigen comprising a polypeptide of the 
invention for which the monoclonal antibodies are specific. 

The concentration of detectably labeled monoclonal antibody which is 
5 administered should be sufficient such that the binding to those cells having 
the polypeptide is detectable compared to the background. Further, it is 
desirable that the detectably labeled monoclonal antibody be rapidly cleared 
from the circulatory system in order to give the best target-to-background 
signal ratio. 

10 As a rule, the dosage of detectably labeled monoclonal antibody for in vivo 
diagnosis will vary depending on such factors as age, sex, and extent of 
disease of the individual. Such dosages may vary, for example, depending on 
whether multiple injections are given, antigenic burden, and other factors 
known to those of skill in the art. 

15 For in vivo diagnostic imaging, the type of detection instrument available is a 
major factor in selecting a given radioisotope. The radioisotope chosen must 
have a type of decay which is detectable for a given type of instrument. Still 
another important factor in selecting a radioisotope for in vivo diagnosis is that 
deleterious radiation with respect to the host is minimized. Ideally, a radio- 

20 isotope used for in vivo imaging will lack a particle emission, but produce a 
large number of photons in the 140-250 keV range, which may readily be 
detected by conventional gamma cameras. 

For in vivo diagnosis radioisotopes may be bound to immunoglobulin either 
directly or indirectly by using an intermediate functional group. Intermediate 
25 functional groups which often are used to bind radioisotopes which exist as 
metallic ions to immunoglobulins are the Afunctional chelating agents such as 
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diethylenetriaminepentacetic acid (DTPA) and ethylenediaminetetraacetic acid 
(EDTA) and similar molecules. Typical examples of metallic ions which can be 
bound to the monoclonal antibodies of the invention are 111 ln, 97 Ru, 67 Ga, ^Ga, 
^As, 89 Zr. and ^Tl. 

5 The monoclonal antibodies of the invention can also be labeled with a 
paramagnetic isotope for purposes of in vivo diagnosis, as in magnetic 
resonance imaging (MRI) or electron spin resonance (ESR). In general, any 
conventional method for visualizing diagnostic imaging can be utilized. Usually 
gamma and positron emitting radioisotopes are used for camera imaging and 
10 paramagnetic isotopes for MRI. Elements which are particularly useful in such 
techniques include 157 Gd, ^Mn, 162 Dy, ^Cr, and ^Fe. 

The monoclonal antibodies of the invention can be used in vitro and in vivo to 
monitor the course of amelioration of a GDF-7-associated disease in a subject. 
Thus, for example, by measuring the increase or decrease in the number of 

15 cells expressing antigen comprising a polypeptide of the invention or changes 
in the concentration of such antigen present in various body fluids, it would be 
possible to determine whether a particular therapeutic regimen aimed at 
ameliorating the GDF-7-associated disease is effective. The term "ameliorate" 
denotes a lessening of the detrimental effect of the GDF-7-associated disease 

20 in the subject receiving therapy. 

The present invention identifies a nucleotide sequence that can be expressed 
in an altered manner as compared to expression in a normal cell, therefore it 
is possible to design appropriate therapeutic or diagnostic techniques directed 
to this sequence. Thus, where a cell-proliferative disorder is associated with 
25 the expression of GDF-7, nucleic acid sequences that interfere with GDF-7 
expression at the translational level can be used. This approach utilizes, for 
example, antisense nucleic acid and ribozymes to block translation of a specific 
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GDF-7 mRNA, either by masking that mRNA with an antisense nucleic acid or 
by cleaving it with a ribozyme. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to 
at least a portion of a specific mRNA molecule (Weintraub, Scientific American. 
262:40, 1990). In the cell, the antisense nucleic acids hybridize to the 
corresponding mRNA. forming a double-stranded molecule. The antisense 
nucleic acids interfere with the translation of the mRNA. since the cell will not 
translate a mRNA that is double-stranded. Antisense oligomers of about 15 
• nucleotides are preferred, since they are easily synthesized and are less likely 
to cause problems than larger molecules when introduced into the target GDF- 
7-producing cell. The use of antisense methods to inhibit the in vitro 
translation of genes is well known in the art (Marcus-Sakura. AnaLBiochem., 
172:289, 1988). 

Ribozymes are RNA molecules possessing the ability to specifically cleave 
other single-stranded RNA in a manner analogous to DNA restriction 
endonucleases. Through the modification of nucleotide sequences which 
encode these RNAs. it is possible to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech. JAmer.Med. 
Assn., 260:3030. 1988). A major advantage of this approach is that, because 
they are sequence-specific, only mRNAs with particular sequences are 
inactivated. 

There are two basic types of ribozymes namely, tetrahymena-type (Hasselhoff. 
Nature, 334:585. 1988) and M hammerhead"-type. Tetrahymena-type ribozymes 
recognize sequences which are four bases in length, while "hammerhead"-type 
ribozymes recognize base sequences 11-18 bases in length. The longer the 
recognition sequence, the greater the likelihood that the sequence will occur 
exclusively in the target mRNA species. Consequently, hammerhead-type 
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ribozymes are preferable to tetrahymena-type ribozymes for inactivating a 
specific mRNA species and 18-based recognition sequences are preferable to 
shorter recognition sequences. 

The present invention also provides gene therapy for the treatment of cell 
proliferative or immunologic disorders which are mediated by GDF-7 protein. 
Such therapy would achieve its therapeutic effect by introduction of the GDF-7 
antisense polynucleotide into cells having the proliferative disorder. Delivery of 
antisense GDF-7 polynucleotide can be achieved using a recombinant expres- 
sion vector such as a chimeric virus or. a colloidal dispersion system. 
Especially preferred for therapeutic delivery of antisense sequences is the use 
of targeted liposomes. 

Various viral vectors which can be utilized for gene therapy as taught herein 
include adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such 
as a retrovirus. Preferably, the retroviral vector is a derivative of a murine or 
avian retrovirus. Examples of retroviral vectors in which a single foreign gene 
can be inserted include, but are not limited to: Moloney murine leukemia virus 
(MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor 
virus (MuMTV), and Rous Sarcoma Virus (RSV). A number of additional 
retroviral vectors can incorporate multiple genes. All of these vectors can 
transfer or incorporate a gene for a selectable marker so that transduced cells 
can be identified and generated. By inserting a GDF-7 sequence of interest 
into the viral vector, along with another gene which encodes the ligand for a 
receptor on a specific target cell, for example, the vector is now target specific. 
Retroviral vectors can be made target specific by inserting, for example, a 
polynucleotide encoding a sugar, a glycolipid, or a protein. Preferred targeting 
is accomplished by using an antibody to target the retroviral vector. Those of 
skill in the art will know of, or can readily ascertain without undue experimenta- 
tion, specific polynucleotide sequences which can be inserted into the retroviral 
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genome to allow target specific delivery of the retroviral vector containing the 
GDF-7 antisense polynucleotide. 

Since recombinant retroviruses are defective, they require assistance in order 
to produce infectious vector particles. This assistance can be provided, for 
5 example, by using helper cell lines that contain plasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences 
within the LTR. These plasmids are missing a nucleotide sequence which 
enables the packaging mechanism to recognize an RNA transcript for 
encapsidation. Helper cell lines which have deletions of the packaging signal 
10 include, but are not limited to *2, PA317 and PA12, for example. These cell 
lines produce empty virions, since no genome is packaged. If a retroviral 
vector is introduced into such cells in which the packaging signal is intact, but 
the structural genes are replaced by other genes of interest, the vector can be 
packaged and vector virion produced. 

15 Alternatively, NIH 3T3 or other tissue culture cells can be directly transfected 
with plasmids encoding the retroviral structural genes gag, pol and env t by 
conventional calcium phosphate transfection. These cells are then transfected 
with the vector plasmid containing the genes of interest The resulting cells 
release the retroviral vector into the culture medium. 

20 Another targeted delivery system for GDF-7 antisense polynucleotides is a 
colloidal dispersion system. Colloidal dispersion systems include macromole- 
cule complexes, nanocapsules, microspheres, beads, and lipid-based systems 
including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The 
preferred colloidal system of this invention is a liposome. Liposomes are 

25 artificial membrane vesicles which are useful as delivery vehicles in vitro and 
in vivo. It has been shown that large unilamellar vesicles (LUV), which range 
in size from 0.2-4.0 ^m can encapsulate a substantial percentage of an 
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aqueous buffer containing large macromolecules. RNA, DNA and intact virions 
can be encapsulated within the aqueous interior and be delivered to cells in a 
biologically active form (Fraley. et a/. Trends Biochem. Sci., 6.:77, 1981). In 
addition to mammalian cells, liposomes have been used for delivery of 
polynucleotides in plant, yeast and bacterial cells. In order for a liposome to 
be an efficient gene transfer vehicle, the following characteristics should be 
present: (1) encapsulation of the genes of interest at high efficiency while not 
compromising their biological activity; (2) preferential and substantial binding 
to a target cell in comparison to non-target cells; (3) delivery of the aqueous 
contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) 
accurate and effective expression of genetic information (Mannino, et al., 
Biotechniques, 6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, 
particularly high-phase-transition-temperature phospholipids, usually in 
combination with steroids, especially cholesterol. Other phospholipids or other 
lipids may also be used. The physical characteristics of liposomes depend on 
pH, ionic strength, and the presence of divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl 
compounds, such as phosphatidylglycerol, phosphatidylcholine, 
phosphatidylserine.phosphatidylethanolamine.sphingolipids.cerebrosides.and 
gangliosides. Particularly useful are diacylphosphatidylglycerols. where the lipid 
moiety contains from 14-18 carbon atoms, particularly from 16-18 carbon 
atoms, and is saturated. Illustrative phospholipids include egg phosphatidyl- 
choline, dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine. 

The targeting of liposomes can be classified based on anatomical and 
mechanistic factors. Anatomical classification is based on the level of 
selectivity, for example, organ-specific, cell-specific, and organelle-specific. 
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Mechanistic targeting can be distinguished based upon whether it is passive 
or active. Passive targeting utilizes the natural tendency of liposomes to 
distribute to cells of the reticuloendothelial system (RES) in organs which 
contain sinusoidal capillaries. Active targeting, on the other hand, involves 
alteration of the liposome by coupling the liposome to a specific ligand such 
as a monoclonal antibody, sugar, glycolipid. or protein, or by changing the 
composition or size of the liposome in order to achieve targeting to organs and 
cell types other than the naturally occurring sites of localization. 

The surface of the targeted delivery system may be modified in a variety of 
ways. In the case of a liposomal targeted delivery system, lipid groups can be 
incorporated into the lipid bilayer of the liposome in order to maintain the 
targeting ligand in stable association with the liposomal bilayer. Various linking 
groups can be used for joining the lipid chains to the targeting ligand. 

Due to the expression of GDF-7 in neural tissue, there are a variety of 
applications using the polypeptide, polynucleotide, and antibodies of the 
invention, related to this tissue. Such applications include treatment of cell 
proliferative disorders involving this tissue. In addition, GDF-7 may be useful 
in various gene therapy procedures. 

The following examples are intended to illustrate but not limit the invention. 
While they are typical of those that might be used, other procedures known to 
those skilled in the art may alternatively be used. 
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10 



15 



20 



EXAMPLE 1 

IDENTIFICA TION AND ISOLATION OF A mox/pi 
TGF-fl FAMILY MEMRFR 

To identify a new member of the TGF-* superfamily, degenerate 
oligonucleotides were designed which corresponded to two conserved regions 
among the known family members: one region spanning the two tryptophan 
residues conserved in all family members except MIS and the other region 
spanning the invariant cysteine residues near the C-terminus. These primers 
were used for polymerase chain reactions on mouse genomic DNA followed 
by subcloning the PCR products using restriction sites placed at the 5' ends 
of the primers, picking individual E. coli colonies carrying these subcloned 
inserts, and using a combination of random sequencing and hybridization 
analysis to eliminate known members of the superfamily. 

GDF-7 was identified from a mixture of PCR products obtained with the primers 
SJL141 : 5 , -CCGGAATTCGGITGG(G/C/A)A(G/AvT/C)(A/G)A(T/C)TGG(A/G)TI 
(A/G)TI(T/G)CICC-3' (SEQ ID NO:1) 

SJL146: 5'CCGGAATTC(G/A)CAI(G/C)C(G/A)CAIG(C/A)(G/A>T/C)C(G/T)IACI 
(G/A)(T/C)CAT-3' (SEQ ID NO:2) 



PCR using these primers was carried out with 2 „g mouse genomic DNA at 
94-C for 1 min, 50'C for 2 min, and 72'C for 2 min for 40 cycles. 



PCR products of approximately 280 bp were gel-purified, digested with Eco Rl, 
gel-purified again, and subcloned in the Bluescript vector (Stratagene, San 
Diego. CA). Bacterial colonies carrying individual subclones were picked into 
96 well microtiter plates, and multiple replicas were prepared by plating the 
25 cells onto nitrocellulose. The replicate filters were hybridized to probes 
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representing known members of the family, and DNA was prepared from non- 
hybridizing colonies for sequence analysis. 

The primer combination of SJL141 and SJL146, encoding the amino acid 
sequences GW(H/Q/N/K/D/E)(D/N)W(V/I/M)(V/I/M)*(A/S)P (SEQ ID NO:3) and 
5 M(V/I/M/T/A)V(R/S)(A/S)C(G/A)C (SEQ ID NO:4), respectively, yielded five 
previously identified sequences (BMP-2, BMP-4, inhibin £B, GDF-3 and GDF-5) 
and one novel sequence, which was designated GDF-7, among 147 subclones 
analyzed. 

EXAMPLE 2 

10 EXPRESSION PATTERN AND SEQUENCE OF GDF-7 

To determine the expression pattern of GDF-7, RNA samples prepared from 
a variety of tissues were screened by Northern analysis and RNase protection. 
As shown in Figure 1 , GDF-7 mRNA was detected in fetal and neonatal brain 
and in the Neuro 2A neuroblastoma cell line. 

15 To obtain a larger segment of the GDF-7 gene, a mouse genomic library was 
screened with a probe derived from the GDF-7 PCR product. The partial 
sequence of a GDF-7 genomic clone is shown in Figure 2a. The sequence 
contains an open reading frame corresponding to the predicted C-terminal 
region of the GDF-7 precursor protein. The predicted GDF-7 sequence 

20 contains a potential proteolytic processing site, which is boxed. Cleavage of 
the precursor at this site would generate a mature C-terminal fragment 146 
amino acids in length with a predicted molecular weight of 14,900. 

The C-terminal region of GDF-7 following the putative proteolytic processing 
site shows significant homology to the known members of the TGF-0 
25 superfamily (Figure 3). Figure 3 shows the alignment of the C-terminal 
sequences of GDF-7 with the corresponding regions of human GDF-1 (Lee, 
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Proc. Natl. Acad. Sci. USA, £8:4250-4254, 1991), human BMP-2 and 4 
(Wozney, et aL, Science, 242:1528-1534, 1988), human Vgr-1 (Celeste, etal., 
Proc. Natl. Acad. Sci. USA, 87:9843-9847, 1990). human OP-1 (Ozkaynak, et 
a/., EMBO J., 9:2085-2093, 1990), human BMP-5 (Celeste, et aL, Proc. Natl. 
5 Acad. Sci. USA. 87:9843-9847, 1990). human BMP-3 (Wozney, et aL, Science, 
242:1528-1534, 1988). human MIS (Cate. etal., Cell, 45:685-698, 1986). human 
inhibin alpha. j)A, and (Mason, ef al., Biochem, Biophys. Res. Commun.. 
135:957-964. 1986). human TGF-/M (Derynck. et aL, Nature, 216:701-705. 
1985). humanTGF-/?2 (deMartin. et aL, EMBO J., 6:3673-3677. 1987). and 
10 human TGF-/J3 (ten Dijke, et aL, Proc. Natl. Acad. Sci. USA, 85:4715-4719, 
1988). The conserved cysteine residues are boxed. Dashes denote gaps 
introduced in order to maximize the alignment. 

GDF-7 contains most of the residues that are highly conserved in other family 
members, including the seven cysteine residues with their characteristic 
15 spacing. 

FIGURE 4 shows the amino acid homologies among the different members of 
the TGF-0 superfamily. Numbers represent percent amino acid identities 
between each pair calculated from the first conserved cysteine to the C- 
terminus. Boxes represent homologies among highly-related members within 
20 particular subgroups. In this region, GDF-7 is most homologous to BMP-2 and 
BMP-4 (57% sequence identity). 

Although the invention has been described with reference to the presently 
preferred embodiment, it should be understood that various modifications can 
be made without departing from the spirit of the invention. Accordingly, the 
25 invention is limited only by the following claims. 
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SUMMARY OF SEQUENCES 

SEQ ID NO: 1 is the nucleotide sequence for the GDF-7 primer, SJL141. 

SEQ ID NO: 2 is the nucleotide sequence for the GDF-7 primer, SJL146. 

SEQ ID NO: 3 is the amino acid sequence for the GDF-7 primer, SJL141. 

5 SEQ ID NO: 4 is the amino acid sequence for the GDF-7 primer, SJL146. 

SEQ ID NO: 5 is the nucleotide and deduced amino acid sequence for the 
GDF-7. 

SEQ ID NO: 6 is the amino acid sequence for the GDF-7. 

SEQ ID NO: 7 is the amino acid sequence for the C-terminal end of GDF-7. 
10 SEQ ID NO: 8 is the amino acid sequence for the C-terminal end of GDF-1. 

SEQ ID NO: 9 is the amino acid sequence for the C-terminal end of BMP-2. 

SEQ ID NO: 10 is the amino acid sequence for the C-terminal end of BMP-4. 

SEQ ID NO: 1 1 is the amino acid sequence for the C-terminal end of Vgr-1. 

SEQ ID NO: 12 is the amino acid sequence for the C-terminal end of OP-1. 
15 SEQ ID NO: 13 is the amino acid sequence for the C-terminal end of BMP-5. 

SEQ ID NO: 14 is the amino acid sequence for the C-terminal end of BMP-3. 
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SEQ ID NO: 15 is the amino acid sequence for the C-terminal end of MIS. 

SEQ ID NO: 1 6 is the amino acid sequence for the C-terminal end of Inhibin- 
alpha. 

SEQ ID NO: 17 is the amino acid sequence for the C-terminal end of Inhibin- 
5 beta-alpha. 

SEQ ID NO: 18 is the amino acid sequence for the C-terminal end of Inhibin- 
beta-beta. 

SEQ ID NO: 19 is the amino acid sequence for the C-terminal end of TGF-beta- 
1. 

0 SEQ ID NO: 20 is the amino acid sequence for the C-terminal end of TGF-beta- 
2. 

SEQ ID NO: 21 is the amino acid sequence for the C-terminal end of TGF-beta- 
3. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: THE JOHNS HOPKINS UNIVERSITY SCHOOL OF MEDICINE 
(ii) TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-7 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Spensley Horn Jubas & Lubitz 

(B) STREET: 1880 Century Park East, Suite 500 

(C) CITY: Los Angeles 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 90067 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1.25 

(vi) CURRENT APPLICATION DATA: 



5 



(iii) 



NUMBER OF SEQUENCES: 21 



20 



(A) APPLICATION NUMBER: PCT 

(B) FILING DATE: 08- JUL- 1994 

(C) CLASSIFICATION: 



(viii) 



ATTORNEY/AGENT INFORMATION: 

(A) NAME: TUMARKIN, LISA A, t PH.D. 

(B) REGISTRATION NUMBER: P-38,347 

(C) REFERENCE/DOCKET NUMBER: FD-2348 



25 



(ix) 



TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 455-5100 

(B) TELEFAX: (619) 455-5110 



30 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL141 

FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .35 

(D) OTHER INFORMATION: /note- "R-adenine or guanine; 

S-cytosine or guanine; M-adenine or cytosinej 
N-adenine, cytosine, guanine or thymine; 
K-thymine or guanine; B-inosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCGGAATTCG GBTGGVANRA YTGGRTBRTB KCBCC 
15 (2) INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL1A6 

FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..33 

(D) OTHER INFORMATION: /note- "R-adenine or guanine; 

S-cytosine or guanine; M-adenine or cytosine; 
N-adenine, cytosine, guanine or thymine; 
Y-cytosine or thymine; K-thymine or guanine; 
B-inosine H 



(ix) 



10 



(i) 



20 



25 



30 



(ix) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCGGAATTCR CABSCRCABG MNCKBACBRY CAT 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 (vii) IMMEDIATE SOURCE: 

(B) CLONE: SJL141 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..9 

15 (D) OTHER INFORMATION: /note- "His-His, Gin, Asn, Lys, Asp 

or Glu; Asp-Asp or Asn; Val-Val, lie or Met; 
Ala-Ala or Ser" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Gly Trp His Asp Trp Val Val Ala Pro 
20 1 5 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL146 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .8 

(D) OTHER INFORMATION: /note- "Val, 

5 or 

Ala; Arg-Arg or Ser; Ala-Ala 
Ala H 



position 1-Val, lie, Met, Th: 
or Ser; Gly-Gly or 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Val Val Arg Ala Cys Gly Cys 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(i) 

15 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: GDF-7 

20 (ix) FEATURE: 

<A) NAME/KEY: CDS 
(B) LOCATION: 34.. 516 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GCTGCAGAGC CGCCACCGGT ACCAGGACCA GGC GCT GGG TCA CGC AAA GCC AAC 54 

Ala Gly Ser Arg Lys Ala Asn 
1 5 



CTG GGC GGT CGC AGG CGG CGG CGG ACT GCG CTG GCT GGG ACT CGG GGA 
Leu Gly Gly Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Gly 
10 15 20 



102 
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GCG CAC GGA AGC GGT GGT GGC CGC GGT GGC GGT GGC GGC GGC GGC GGC 
Ala Gin Gly Ser Gly Gly Gly Cly Gly Gly Gly Gly Gly Gly Gly Gly 
25 30 35 

GGC GGC GGC GGC GGC GGC GGC GGC GCA GGC AGG GGC CAC GGG CGC AGA 
5 Gly Gly Gly Gly Gly Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg 

AO 45 50 55 

GGC CGG AGC CGC TGC AGT CGC AAG TCA CTG CAC GTG GAC TTT AAG GAG 
Gly Arg Ser Arg Cys Ser Arg Lys Ser Leu His Val Asp Phe Lys Glu 
60 65 70 • 

10 CTG GGC TGG GAC GAC TGG ATC ATC GCG CCA TTA GAC TAC GAG GCA TAC 

Leu Gly Trp Asp Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr 
75 80 85 ■ 

CAC TGC GAG GGC GTT TGC GAC TTT CCT CTG CGC TCG CAC CTG GAG CCT 
His Cys Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro 
15 90 95 100 

ACC AAC CAC GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG GCG CCC GAC 
Thr Asn His Ala He He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp 
105 HO us 

GCT GCG CCA GCC TCC TGC TGC GTG CCC GCA AGG CTC AGT CCC ATC AGC 
Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser 
120 125 130 135 

ATT CTC TAC ATC GAT GCC GCC AAC AAC GTG GTC TAC AAG CAG TAC GAA 
He Leu Tyr He Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu 
1*0 145 " 150 

25 GAC ATG GTG GTG GAG GCC TGC GGC TGC AGG TAG 

Asp Met Val Val Glu Ala Cys Gly Cys Arg 
155 160 



20 



150 



198 



246 



294 



342 



390 



438 



486 



519 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Cly Ser Arg Lys Ala Asn Leu Cly Cly Arg Arg Arg Arg Arg Thr 
1 5 10 is 

Ala Leu Ala Gly Thr Arg Gly Ala Gin Gly Ser Gly Gly Gly Gly Gly 
5 20 25 " 30 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly. Gly Gly Gly Ala 
35 40 45 

Gly Arg Gly His Gly Arg Arg Cly Arg Ser Arg Cys Ser Arg Lys Ser 
50 55 60 

10 Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp Asp Trp He He Ala 
65 70 75 g0 

Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro 
85 90 95 

Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu 
15 100 105 no 

Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser Cys Cys Val Pro 
115 120 125 

Ala Arg Leu Ser Pro He Ser He Leu Tyr He Asp Ala Ala Asn Asn 
130 135 140 

20 Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ala Cys Gly Cys 
145 150 155 160 

Arg 

(2) INFORMATION FOR SEQ ID NO: 7: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: procein 
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(vii) IMMEDIATE SOURCE: 
<B) CLONE: CDF- 7 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
5 (B) LOCATION: 1. .119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Gly Cly Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser 
1 5 10 15 

Arg Cys Ser Arg Lys Ser Leu His Val Asp Phe Lys Glu Leu Gly Trp 
10 20 25 * 30 

Asp Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu 
35 40 45 

Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His 
50 55 60 

15 Ala He He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro 

65 70 75 80 

Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser He Leu Tyr 
85 90 * 95 

He Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val 
20 100 105 HO 

Val Glu Ala Cys Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: CDF-1 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
5 (B) LOCATION: 1..123 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Arg Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 
1 5 io " ' 15 

Ala Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp 
10 20 25 30 

His Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin 
35 40 45 

Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro 
50 55 60 

15 Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro 

65 70 75 80 

Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro He 
85 90 " 95 

Ser Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr 
20 100 105 no 

Glu Asp Met Val Val Asp Glu Cys Gly Cys Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP-2 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
5 (B) LOCATION: I. .118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg Lys Arg Lea Lys Ser 
1 5 10 15 

Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp 
10 20 25 30 

Asn Asp Trp He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His 
35 40 45 

Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His 
50 .55 60 

15 Ala Ile Va l Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys 

65 70 75 80 

Ala Cys Cys Val Pro Thr Glu Leu Ser Ala Ile Ser Met Leu Tyr Leu * 
85 90 95 

Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val 
20 100 10 5 no 

Glu Gly Cys Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



W ° 95/01802 PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP-4 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
5 (B) LOCATION: 1..118 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys 
1 5 io 15 

Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Tn> 
10 20 25 30 

Asn Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His 
35 40 45 

Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His 
50 55 60 

15 Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys 

65 70 75 80 

Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Mec Leu Tyr Leu 
85 90 95 

Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu MeC Val Val 
20 100 105 110 

Glu Gly Cys Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: Vgr-1 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .119 



-38- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ser Arg Gly Ser Gly Ser Ser Asp Tyr Asn Gly Ser Glu Leu Lys Thr 
1 5 10 15 

Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp 
10 20 25 30 

Gin Asp Trp He He Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp 
35 40 45 

Gly Glu Cys Ser Phe Pro Leu Asn Ala His Met. Asn Ala Thr Asn His 
50 55 60 

15 Ala He Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro 

65 70 75 80 

Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr 
85 90 95 

Phe Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val 
20 100 105 HO 

Val Arg Ala Cys Gly Cys His 
115 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: OP-1 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser Asp Gin Arg Gin 
1 5 10 15 

Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp 
10 20 25 30 

Gin Asp Trp He He Ala Pro Glu Gly Tyr . Ala Ala Tyr Tyr Cys Glu 
35 40 45 

Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr Asn His 
50 55 60 

15 Ala lie Val Gin Thr Leu Val His Phe He Asn Pro Glu Thr Val Pro 

65 70 75 80 

Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val Leu Tyr 
85 90 95 

Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val 
20 100 105 * * no 

Val Arg Ala Cys Gly Cys His 
115 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP -5 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
5 (B) LOCATION: 1..119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ser Arg Met Ser Ser Val Gly Asp Tyr Asn Thr Ser Glu Gin' Lys Gin 
1 5 .10 15 

Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp 
10 20 25 30 

Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp 
35 40 45 

Gly Glu Cys Ser Phe Pro Leu Asn Ala His Mec Asn Ala Thr Asn His 
' 50 55 60 

15 Ala He Val Gin Thr Leu Val His Leu Mec Phe Pro Asp His Val Pro 

^5 70 75 80 

Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr 
85 90 95 

Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val 
20 100 105 no 

Val Arg Ser Cys Gly Cys His 
115 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: BMP- 3 

(ix) FEATURE: 

(A) NAME/KEY: Procein 
5 (B) LOCATION: 1..120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Gin Thr Leu Lys Lys Ala Arg Arg Lys Gin Trp lie Glu Pro Arg 
1 5 10 15 

Asn Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp 
10 20 25 30 

Ser Glu Trp He He Ser Pro Lys Ser Phe Asp Ala Tyr Tyr Cys Ser 
35 40 45 

Gly Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His 
50 55 60 

15 Ala Thr He Gin Ser He Val Arg Ala Val Gly Val Val Pro Gly He 

65 70 75 80 

Pro Glu Pro Cys Cys Val Pro Glu Lys Met Ser Ser Leu Ser He Leu 
85 90 95 

Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met 
20 100 105 HO 

Thr Val Glu Ser Cys Ala Cys Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: MIS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: L. .116 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Pro Gly Arg Ala Gin Arg Ser Ala Gly Ala Thr Ala Ala Asp Gly 
15 10 15 

Pro Cys Ala Leu Arg Glu Leu Ser Val Asp Leu Arg Ala Glu Arg Ser 
10 20 25 30 

Val Leu He Pro Glu Thr Tyr Gin Ala Asn Asn Cys Gin Gly Val Cys 
35 40 45 

Gly Trp Pro Gin Ser Asp Arg Asn Pro Arg Tyr Gly Asn His Val Val 
50 55 60 

15 Leu Leu Leu Lys Met Gin Ala Arg Gly Ala Ala Leu Ala Arg Pro Pro 

65 70 75 " 80 

Cys Cys Val Pro Thr Ala Tyr Ala Gly Lys Leu Leu He Ser Leu Ser 
85 90 95 

Glu Glu Arg He Ser Ala His His Val Pro Asn Met Val Ala Thr Glu 
20 100 105 HO 

Cys Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Inhib in- alpha 

(ix) FEATURE: 

(A) NAME/KEY: Procein 

(B) LOCATION: 1. .122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Ala Leu Arg Leu Leu Gin Arg Pro Pro Glu Glu Pro Ala Ala His Ala 
1 5 10 15 

Asn Cys His Arg Val Ala Leu Asn He Ser Phe Gin Glu Leu Gly Trp 
10 20 25 30 

Glu Arg Trp He Val Tyr Pro Pro Ser Phe He Phe His Tyr Cys His 
35 40 45 

Gly Gly Cys Gly Leu His He Pro Pro Asn Leu Ser Leu Pro Val Pro 
50 55 60 

15 G1 y Aia Pro Pro Thr Pro Ala Gin Pro Tyr Ser Leu Leu Pro Gly Ala 

65 70 75 80 

Gin Pro Cys Cys Ala Ala Leu Pro Gly Thr Met Arg Pro Leu His Val 
85 90 95 

Arg Thr Thr Ser Asp Gly Gly Tyr Ser Phe Lys Tyr Glu Thr Val Pro 
20 100 105 ' no 

Asn Leu Leu Thr Gin His Cys Ala Cys He 
115 120 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Inhibin-beta-alpha 

(ix) FEATURE: 

(A) NAME/KEY : Protein 

(B) LOCATION: 1..122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Arg Arg Arg Arg Arg Gly Leu Glu Cys Asp Gly Lys Val Asn He 
1 5 10 15 

Cys Cys Lys Lys Gin Phe Phe Val Ser Phe Lys Asp He Gly Trp Asn 
10 20 25 30 

Asp Trp He He Ala Pro Ser Gly Tyr His Ala Asn Tyr Cys Glu Gly 
35 40 45 

Glu Cys Pro Ser His He Ala Gly Thr Ser Gly Ser Ser Leu Ser Phe 
50 55 60 

15 His Ser Thr Val He Asn His Tyr Arg Met Arg Gly His Ser Pro Ph 

65 70 75 80 

Ala Asn Leu Lys Ser Cys Cys Val Pro Thr Lys Leu Axg Pro Met S r 
85 90 95 

Met Leu Tyr Tyr Asp Asp Gly Gin Asn He He Lys Lys Asp He Gin 
20 100 105 HO 

Asn Met He Val Glu Glu Cys Gly Cys Ser 
115 120 

(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Inhibin-beca-beta 

(ix) FEATURE : 

(A) NAME/KEY: Protein 

(B) LOCATION: I.. 121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



His Arg He Arg Lys Arg Gly Leu Glu Cys Asp Gly Arg Thr Asn Leu 
1 5 10 15 

Cys Cys Arg Gin Gin Phe Phe He Asp Phe Arg Leu He Gly Trp Asn 
10 20 25 30 

Asp Trp He He Ala Pro Thr Gly Tyr Tyr Gly Asn Tyr Cys Glu Gly 
35 40 45 

Ser Cys Pro Ala Tyr Leu Ala Gly Val Pro Gly Ser Ala Ser Ser Phe 
50 55 60 

15 His Thr Ala Va * v *l Asn Gin Tyr Arg Met Arg Gly Leu Asn Pro Gly 

65 70 75 80 

Thr Val Asn Ser Cys Cys He Pro Thr Lys Leu Ser Thr Met Ser Met 
85 90 " 95 

Leu Tyr Phe Asp Asp Glu Tyr Asn He Val Lys Arg Asp Val Pro Asn 
20 100 105 ^ no 

Met He Val Glu Glu Cys Gly Cys Ala 
115 120 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beCa-1 

(ix) FEATURE: 

(A) NAME/KEY: Procein 

(B) LOCATION: 1..1L5 



-46- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

His Arg Arg Ala Leu Asp Thr Asn Tyr Cys Phe Ser Ser Thr Glu Lys 
1 5 10 15 

Asn Cys Cys Val Arg Gin Leu Tyr He Asp Phe Arg Lys Asp Leu Gly 
10 20 25 30 

Trp Lys Trp He His Glu Pro Lys Gly Tyr His Ala Asn Phe Cys Leu 
35 40 45 

Gly Pro Cys Pro Tyr He Trp Ser Leu Asp Thr Gin Tyr Ser Lys Val 
50 55 60 

15 Leu Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro Cys 

65 70 75 80 

Cys Val Pro Gin Ala Leu Glu Pro Leu Pro He Val Tyr Tyr Val Gly 
85 90 95 

Arg Lys Pro Lys Val Glu Gin Leu Ser Asn Met He Val Arg Ser Cys 
20 100 105 no 

Lys Cys Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCT/US94/07799 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beta-2 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .115 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Lys Arg Ala Leu Asp Ala Ala Tyr Cys Phe Arg Asn Val Gin Asp 
1 5 10 15 

Asn Cys Cys Leu Arg Pro Leu Tyr He Asp Phe Lys Arg Asp Leu Gly 
10 20 25 30 

Trp Lys Trp He His Glu Pro Lys Gly Tyr Asn Ala Asn Phe Cys Ala 
35 40 45 

Gly Ala Cys Pro Tyr Leu Trp Ser Ser Asp Thr Gin His Ser Arg Val 
50 55 60 

15 Leu Ser Leu Tyr Asn Thr He Asn Pro Glu Ala Ser Ala Ser Pro Cys 

65 70 75 80 

Cys Val Ser Gin Asp Leu Glu Pro Leu Thr He Leu Tyr Tyr' He Gly 
85 90 95 

Lys Thr Pro Lys He Glu Gin Leu Ser Asn Met He Val Lys Ser Cys 
20 100 105 HO 

Lys Cys Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 95/01802 



PCTAJS94/07799 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: TGF-beta-3 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
5 (B) LOCATION: 1. .115 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Lys Lys Arg Ala Leu Asp Thr Asn Tyr Cys Phe Arg Asn Leu Glu Glu 
1 5 10 15 



Asn Cys Cys Val Arg Pro Leu Tyr He Asp Phe Arg Gin Asp Leu Gly 

10 20 25 30 

Trp Lys Trp Val His Glu Pro Lys Gly Tyr Tyr Ala Asn Phe Cys S r 
35 40 45 

Gly Pro Cys Pro Tyr Leu Arg Ser Ala Asp Thr Thr His Ser Thr Val 

50 55 60 

15 Leu Gly Leu Tyr Asn Thr Leu Asn Pro Glu Ala Ser Ala Ser Pro Cys 

65 70 75 80 

Cys Val Pro Gin Asp Leu Glu Pro Leu Thr He Leu Tyr Tyr Val Gly 
85 90 95 

Arg Thr Pro Lys Val Glu Gin Leu Ser Asn Met Val Val Lys Ser Cys 

20 100 105 HO 



Lys Cys Ser 
115 



